Error Correlation and Error Reduction in Ensemble Classifiers
نویسندگان
چکیده
Using an ensemble of classi ers, instead of a single classi er, can lead to improved generalization. The gains obtained by combining however, are often a ected more by the selection of what is presented to the combiner, than by the actual combining method that is chosen. In this paper we focus on data selection and classi er training methods, in order to \prepare" classi ers for combining. We review a combining framework for classi cation problems that quanti es the need for reducing the correlation among individual classi ers. Then, we discuss several methods that make the classi ers in an ensemble more complementary. Experimental results are provided to illustrate the bene ts and pitfalls of reducing the correlation among classi ers, especially when the training data is in limited supply.
منابع مشابه
Application of ensemble learning techniques to model the atmospheric concentration of SO2
In view of pollution prediction modeling, the study adopts homogenous (random forest, bagging, and additive regression) and heterogeneous (voting) ensemble classifiers to predict the atmospheric concentration of Sulphur dioxide. For model validation, results were compared against widely known single base classifiers such as support vector machine, multilayer perceptron, linear regression and re...
متن کاملDiversity in
Diversity in ensembles is the key to improved accuracy. Six pair wise diversity measures have been studied and Plain Disagreement and Kappa coefficient are being recommended for ensemble construction as they exhibit high correlation with error reduction. We have also verified that correlation between diversity and error reduction increases with an increase in the ensemble size. This study also ...
متن کاملVariance reduction trends on 'boosted' classifiers
Ensemble classification techniques such as bagging, (Breiman, 1996a), boosting (Freund & Schapire, 1997) and arcing algorithms (Breiman, 1997) have received much attention in recent literature. Such techniques have been shown to lead to reduced classification error on unseen cases. Even when the ensemble is trained well beyond zero training set error, the ensemble continues to exhibit improved ...
متن کاملEnsemble classifiers based on correlation analysis for DNA microarray classification
Since accurate classification of DNA microarray is a very important issue for the treatment of cancer, it is more desirable to make a decision by combining the results of various expert classifiers rather than by depending on the result of only one classifier. In spite of the many advantages of mutually error-correlated ensemble classifiers, they are limited in performance. It is difficult to c...
متن کاملOn Error Correlation and Accuracy of Nearest Neighbor Ensemble Classifiers
Recent empirical work has shown that combining predictors can lead to significant reduction in generalization error. Unfortunately, many combining methods do not improve nearest neighbor (NN) classifiers at all. This is because NN methods are very robust with respect to variations of a data set. In contrast, they are sensitive to input features. We exploit the instability of NN classifiers with...
متن کاملFault Detection of Bearings Using a Rule-based Classifier Ensemble and Genetic Algorithm
This paper proposes a reduct construction method based on discernibility matrix simplification. The method works with genetic algorithm. To identify potential problems and prevent complete failure of bearings, a new method based on rule-based classifier ensemble is presented. Genetic algorithm is used for feature reduction. The generated rules of the reducts are used to build the candidate base...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Connect. Sci.
دوره 8 شماره
صفحات -
تاریخ انتشار 1996